Method and apparatus for processing voice data

ABSTRACT

One aspect of the present disclosure is a device for processing voice data associated with an application program. The application program has a form therein for entering voice data. The application program provides an identifier associated with voice data for identifying the form the voice data is associated with. The device includes an input portion for receiving voice data and identifier information from the application program. Also included is a translation portion for translating voice data to text data. The translation portion receives the voice data from the input portion and translates the voice data to text data. Also included is an output portion for sending output text data and the identifier information to the application program for association with the form.

BACKGROUND OF THE INVENTION

[0001] The present invention relates to a method and apparatus forprocessing voice data for use in a variety of voice applications such asconversion of voice data into text data.

[0002] There is an ever-increasing need for improved productivity in awide variety of professions. In those professions where productivity isparticularly crucial, many professionals make use of techniques forconverting voice data into text data. By relieving the professionals ofhaving to enter keystrokes using a keyboard, these professionals areable to devote their time and energy to higher valued uses. In addition,freeing these professionals from the keyboard is important in manyprofessions that require significant traveling or fieldwork.

[0003] Historically, the conversion of voice data to text data has beenaccomplished using a manual process. This process requires theprofessional to dictate, sometimes using a small recorder to capturetheir voice to tape or to dictate directly to a skilled assistant. Theskilled assistant then transcribes or types entering this informationinto a word processing system to convert the voice information either ontape or directly spoken to the assistant to create this text data ortext document.

[0004] More recently, speech to text applications have evolved, whichconvert spoken language into text form which is then made accessible tothe user in some form of a word processing system. Some examples of theimplementation of voice recognition technology are Dragon™ (a product ofScansoft) and ViaVoice™ (a product of International Business MachinesCorporation (IBM)).

[0005] Each of these techniques for converting voice data to text datahas strengths and weaknesses. Manual transcription can be very accuratebut tends to be expensive. In contrast, the speech to text applicationstend to be relatively inexpensive but this technique tends to be lessaccurate than manual transcription.

[0006] The conversion of speech to text is used in a wide variety ofapplications. Some of these applications include medical applicationswhere physicians must document examinations for ongoing clinical care.Because of the large number of patients, a typical doctor is required tosee everyday, physicians frequently dictate notes regarding patientcare. In addition, physicians frequently dictate e-mails to otherphysicians as well as e-mails to patients. Another application whereconversion of speech to text is frequently used is in legalapplications. Frequently, lawyers dictate letters and memos as well assubstantive legal documents, such as agreements and legal briefs to savetime.

[0007] There is an ever present need for systems to handle theconversion of voice data to text data in a flexible manner that allowsuse of a technique for converting voice data into text data that is wellsuited for the particular application. In addition, these systems forconverting voice data to text data should be cost effective and seamlessfor the user so as to provide ease of use. Finally, these systems shouldbe capable of providing conversion of speech data to text data in arapid fashion, when required by the application.

SUMMARY OF THE INVENTION

[0008] One aspect of the exemplary embodiment of the present inventionis a device for processing voice data associated with an applicationprogram. The application program has a form therein for entering voicedata. The application program provides an identifier associated withvoice data for identifying the form the voice data is associated with.The device includes an input portion for receiving voice data andidentifier information from the application program. Also included is atranslation portion for translating voice data to text data. Thetranslation portion receives the voice data from the input portion andtranslates the voice data to text data. Also included is an outputportion for sending output text data and the identifier information tothe application program for association with the form

[0009] Another aspect of the present invention is a system forprocessing voice data. The system includes a processing device runningan application program. The processing device has an input device forreceiving a voice signal and converting that voice signal into voicedata. Included, is an application processing device configured forInternet connection. The application processing device has a data linkfor receiving voice data in real time from the processing device. Alsoincluded is a voice processing device configured for Internet connectionwhereby voice data is passed in real time from the applicationprocessing device to the voice processing device.

[0010] Yet another aspect of the present invention is a device forcollecting data. The device includes a display portion for displaying aform having an insertion portion for inserting information therein. Theinsertion portion is selectable for capturing voice data from an inputdevice. Also included is a control portion for tagging the voice datawith an identifier for associating the voice data to the form.

BRIEF DESCRIPTION OF THE DRAWINGS

[0011]FIG. 1 is a block diagram representing an exemplary system of thepresent invention for processing voice data.

[0012]FIG. 2 is a simplified block diagram of a programmable processorof the type used in various components of the exemplary embodiment shownin FIG. 1.

[0013]FIG. 3 is a simplified flow diagram representing a method of theexemplary embodiment for passing voice data from an application to avoice processing device shown in FIG. 1.

[0014]FIG. 4 is a simplified representation of command information,voice data, and identification information that is wrapped in anenvelope provided by an application shown in FIG. 1.

[0015]FIG. 5 is a simplified representation of a header that includesdestination address, source address, destination port, and source portused to post information by the application processing device shown inFIG. 1.

[0016]FIG. 6 is a simplified flow diagram representing the flow of textdata from the voice processing device to the application, whichassociates the text data with the form

[0017]FIG. 7 is a simplified representation of the command information,identification information and text data provided to the application bythe application processing device shown in FIG. 1.

[0018]FIG. 8 depicts an exemplary form that is displayed by theapplication to illustrate the insertion of the text data into the form.

DETAILED DESCRIPTION OF THE EXEMPLARY EMBODIMENT

[0019]FIG. 1 represents a system 10 of an exemplary embodiment of thepresent invention for processing voice data. The system 10 includes anapplication 12 for generating voice data that is associated with a form.The voice data is provided to an application processing device 14, whichin turn passes the voice data to a voice processing device 16 via a datalink 18. The voice processing device 16 passes the voice data to atranscriptor 20 that converts the voice data into text data. The textdata is then returned to the voice processing device 16, which thenpasses the text data to the application processing device 14 via thedata link 18. The application processing device 14 provides the textdata back to the application 12 whereupon the text data is associatedwith the form.

[0020] The method and apparatus of the exemplary embodiment of thepresent invention 10 provides a system for converting voice data intotext data in a manner that is highly flexible, scalable and seamless tothe user of the application 12. In one exemplary embodiment, a pluralityof applications 12 are supported by each application processing device14. In addition, the voice processing device 16 is capable of supportinga plurality of application processing devices 14. By increasing theapplication processing devices 14 and applications 12, the system 10 canbe configured to support a wide variety of different requirements.

[0021] Another aspect of the exemplary embodiment, is that voice data isentered directly into a form, as will be discussed in more detail withrespect to FIG. 8. Entering voice data into a form, makes use ofestablished workflow as defined by the forms. In addition, entering formdata using dictation is highly efficient. For example, a physicianentering data into a patient's chart subsequent to examination of thepatient. The physician examines the patient's chart using theapplication 12 whereupon the physician selects portions of the chart anddictates or speaks into the input device associated with the application12 to capture voice data. This voice data is assigned identificationinformation for associating the voice data with the particular portionof the patient's chart that voice data was intended. As will bediscussed, the text data, which is returned from the voice processingdevice 16, is packaged with the identification information so that theapplication 12 can insert the text in the proper position within thepatient's chart. This technique provides great ease of use for the userof application 12, thereby improving efficiency and reducing trainingtime and training costs.

[0022] In the exemplary embodiment, application 12 is a programmabledevice that executes software or firmware that is stored in memory. FIG.2 represents a simplified programmable processing device 22 such as usedfor application 12. The programmable processing device 22 includes acentral processing unit 24 for processing instructions, which are storedin a memory 26. The central processing unit sends and receivesinformation from the programmable processing device 22 via an inputoutput portion 28. A mass storage 30 is provided for storing data andother information.

[0023] In one exemplary embodiment, the application 12 is a personalcomputer or handheld processing device such as a Personal DigitalAssistant (PDA). Two examples of a PDA include the IPAQ or the Jomada,each manufactured by the Hewlett-Packard Company in Palo Ato, Calif. ThePDA typically includes a display, a microphone for inputting voice dataand a linking device such as a wireless networking device fortransferring information from the application 12 to the applicationprocessing device 14 in a wireless manner. One such wirelessconnectivity devices is a radio frequency transmission device, whichtransmits data using the 80211B protocol commonly, referred to WI-FL.Alternatively, the linking device can be a variety of devices forexchanging information between the application 12 and the applicationprocessing device 14 such as an internet data connection or satellitelink for exchanging information.

[0024] The application processing device 14 in the exemplary embodimentis a programmable processing device 22 similar in basic structure to theprogrammable device shown in FIG. 2. The application processing device14 executes software or firmware that is stored on the applicationprocessing device 14. The application processing device 14 in additionto exchanging information with the application 12 also exchangesinformation with the voice processing device 16. In the exemplaryembodiment, information is exchanged between the voice processing device16 and the application processing device 14 by way of an Internetconnection. In this exemplary embodiment, each of the applicationprocessing device 14 and the voice processing device 16 make use of aprotocol for exchanging information in a decentralized, distributedenvironment. In this exemplary embodiment, a Simple Object AccessProtocol (SOAP) is used for the exchange of information.

[0025] The voice processing device 16 in the exemplary embodiment is aprogrammable processing device 22 that is similar in basic structure tothe programmable device shown in FIG. 2. The voice processing device 16is controlled by software stored on the programmable device, whichimplements the exchange of information with the application processingdevice 14 using a protocol such as the SOAP protocol. The applicationprocessing device 14 and voice processing device 16 can, alternatively,use other protocols for exchanging information via the Internet 18. Inaddition, the application processing device 14 and the voice processingdevice 16 can use other medium for transmitting information therebetween such as satellite link, telephone line or any other techniquefor transferring data therebetween.

[0026] In the exemplary embodiment, the application processing device 14acts as a client and the voice processing device 16 acts as a server ina client/server relationship. As a server, the voice processing device16 will never initiate a communication with the application processingdevice 14. In this relationship, the application processing device 14must initiate connection with the voice processing device 16 forexchanging information therebetween. Once the transaction is completed,the application processing device 14 terminates communication with thevoice processing device 16. The application processing device 14 thenmust poll the voice processing device 16 periodically to determine ifthe voice processing device 16 has information to transfer to theapplication processing device 14. It is difficult to establishcommunication to an application processing device 14 because of the useof firewalls and proxy servers for security. The use of a client/serverrelationship allows the exchange of information with applicationprocessing devices 14 that have security features such as firewalls. Forthe case where the application processing device 14 does not usesecurity features a client/client relationship can exist between each ofthe voice processing device 16 and application processing device 14whereupon each can initiate communication with the other as soon as thesituation dictates.

[0027] The transcriptor 20 makes use of a wide variety of techniques forconverting speech data into text data. For example, the transcriptor 20in one exemplary embodiment is a manual transcription process. The voiceprocessing device 16 passes the voice data to the transcriptor 20 whothen manually types in the text data while listening to the voice data.The voice data is passed to the transcriptor 20 using one of a varietyof techniques such as an Internet connection, phone line, radiofrequency link or some other technique for passing voice data from thevoice processing device 16 to the transcriptor 20.

[0028] Alternatively, the transcriptor 20 is a speech to textapplication for converting the voice data to text data. This speech totext application can be one of a variety of speech to text applicationssuch as Dragon™ of Via Voice™ previously mentioned. This speech to textapplication can be running on the voice processing device 16 oralternatively, can be run on a programmable processor such as shown inFIG. 2.

[0029] Once the transcriptor 20 converts the voice data to text data,the text data is returned to the voice processing device 16 for postingto the application processing device 14 and ultimately back to theapplication 12. Further details of the technique for passing data fromthe application 12 to the transcriptor 12 and back again will now bediscussed with respect to FIGS. 3 through 6.

[0030]FIG. 3 is a simplified flow diagram depicting the method forpassing voice data from the application 12 to the voice processingdevice 16. The process begins with the application 12 associating voicedata with a form and assigning identification information to that voicedata so that the application 12 can associate that voice data with theparticular speaker and form for which the voice data corresponds. Inaddition, the application 12 assigns command information to the voicedata for specifying instructions for handling the voice data. Thecommand information may specify that the voice data is to be convertedto text data and returned. Other types of commands may include a varietyof types of things such as to send the voice data to a distribution listor have the voice data converted to text data and e-mailed to adistribution list, to name a few. The command information specifies howthe voice data is to be handled. Handling of the voice data can beaccomplished in a wide variety of ways, depending on the capabilities ofthe voice processing device 16. The assigning of the command informationand identification information to the voice data by the application 12is represented by step 32.

[0031]FIG. 4 represents an application envelope of data 34 that isprovided by the application 12 to the application processing device 14.In an exemplary embodiment, the envelope 34 includes voice data 36,identification information 38 and command information 40. This data ispackaged in an envelope 34 that is passed to the application processingdevice 14 as represented by step 40 in FIG. 3. As discussed previouslywith respect to FIG. 1, the envelope information 34 can be passed fromthe application 12 to the application processing device 14 using avariety of techniques for passing data including using a wireless linkor an electrical or optical connection.

[0032] The application processing device 14 sends the envelopeinformation 34 to the voice processing device 16 as represented by step42. The envelope information 34 can be sent in a wide variety of waysincluding an Internet connection, a wireless link, a satelliteconnection, or some form of hardwired connection such as a telephoneconnection. In the exemplary embodiment, the application processingdevice 14 attaches a header 42 and includes this header with a payloadcontaining the envelope information 34. In the exemplary embodiment, aprotocol for transferring information between the application processingdevice 14 and the voice processing device 16 is the SOAP protocol andHTTP extension framework. In the exemplary embodiment, the header 42,includes a destination address 44, a destination port 46, a sourceaddress 48, and a source port 50 as shown in FIG. 5. The destinationaddress and destination port corresponds to a corresponding address andport of the voice processing device 16. The source address and thesource port corresponds to an address and port associated with theapplication processing device 14.

[0033] The voice processing device 16 receives the header 42 containingthe envelope 34, as payload as represented by step 52. The voiceprocessing device 16 acknowledges receipt of the payload therebyallowing the application processing device 14 to disconnect from thevoice processing device 16. There are some inherent delays in thesending of voice data from the application 12 to the applicationprocessing device 14. These delays are due to the billing of input andoutput buffers as well as overhead, required in establishing connectionbetween the application 12 and the application processing device 14.Once the application processing device 14 receives data and establishesa connection with the voice processing device 16, this voice data can bestreamed in a real time manner from the application processing device 14to the voice processing device 16. Therefore, it is possible for alonger dictation or relatively large voice data files to have aconnection between the application 12 and application processing device14 as well as simultaneously between the application processing device14 and the voice processing device 16 as data is passed from theapplication 12 to the voice processing device 16. Because the flow ofdata between the application processing device 14 and the voiceprocessing device 16 for this interconnect connection is typicallysufficiently high bandwidth to support continuous speech. Thisconnection will be referred to herein as “real time.”

[0034] Once the voice processing device 16 receives the voice data fromthe application 12, the voice processing device 16 performs thosefunctions specified by the command information 40 as represented by step54 in FIG. 3. As discussed previously, the voice processing device 16may be capable of providing a wide variety of functions in addition toconverting voice data into text data. Once the voice data is convertedinto text data, the voice processing device 16 updates status to reflexthat this command has been completed. As discussed previously, theconversion of voice data to text data can be accomplished by the voiceprocessing device 16 itself or this function can be off-loaded to thetranscriptor 20.

[0035]FIG. 6 represents the method of transferring text data from thevoice processing device 16 back to the application 12. As discussedpreviously, in the exemplary embodiment, the application processingdevice 14 acts as a client and the voice processing device 16 acts as aserver in a server client relationship. However, in the case where thevoice processing device 16 acts as a client, the voice processing device16 can initiate connection with the application processing device 14when the text data is ready. However, in the case where the voiceprocessing device 16 is acting as a server, then text data cannot bepassed from the voice processing device 16 to the application processingdevice 14 until the application processing device 14 polls the voiceprocessing device 16 and determines from the status that text data isready as represented by step 56. Once text data is ready, theapplication processing device 14 makes a request for the text data fromthe voice processing device 16 as represented by step 58.

[0036] The voice processing device 16 then prepares an envelope 60 andheader for posting as represented by step 68. The envelope informationshown in FIG. 7 is similar to the envelope 60 sent previously by theapplication processing device 14 shown in FIG. 4 except that the voicedata 36 has been replaced by text data 66. The envelope 60 containsidentification information 64 so that the application 12 can associatethe text data 66 with the proper form, also included in the envelope 60,as command information 62.

[0037] Prior to sending the envelope 60 from the voice processing device16 to the application processing device 14, the voice processing device16 prepares a header that is similar to the header used by theapplication processing device 14 to send information to the voiceprocessing device 16 except that the destination address 44 and thesource address 48 are swapped and the destination port 46 and the sourceport 50 are swapped. The voice processing device 16 posts the header andpayload 60 as represented by step 68. In response to posting, theapplication processing device 14 receives the payload as represented bystep 70 and acknowledges receipt.

[0038] The application processing device 14 passes the payload orenvelope information, which includes text data, identifier information,and command information to the application 12 as represented by step 71.Once the application 12 receives the text data 66, identifierinformation 64, and command information 62, the application can thenassociate the text data to a particular form by using the identificationinformation 64. The identification information 64 is used to tag voicedata which leaves the application 12 and that tag data remains with thevoice data even after this data is converted to text data so that theapplication 12 can properly associate the text data to the proper form.

[0039] In operation, the exemplary embodiment of the method andapparatus of the present invention will now be described with respect toa specific application that relates to medical dictation. In thisapplication, the application 12 is a PDA that is operating a medicalrecords application. It should be noted that a wide variety of otherapplications are also suitable for using the method and apparatus of thepresent invention. This exemplary application shown in FIG. 8 representsa patient's chart that the physician fills in after examining thepatient. In this exemplary embodiment, the patient's chart is displayedon the display 72 of the PDA. The physician then selects a portion ofthe form in which they desire to enter voice data as represented by box74. The physician dictates or speaks into a microphone associated withthe PDA to capture voice data Application software running on the PDAcaptures this voice data and attaches identification information forassociating this voice data with this particular form. In this exemplaryembodiment, the voice data is associated with Angela Abrams' patientchart under the basic form, six month checkup as well as the physiciandictating the notes is identified so that the voice data is associatedwith the proper location and form.

[0040] This voice data is then sent from the application 12 to theapplication processing device 14 and then to the voice processing device16. In the exemplary embodiment, the PDA is equipped with a wirelessnetworking card for transferring this information in a wireless fashionto the application processing device 14.

[0041] The voice processing device 16 receives this voice data andcommand information and determines that a conversion from voice data totext is requested by the application 12. The voice processing device 16sends the voice data to the transcriptor 20 for transcription. Oncetranscription is complete, the voice processing device 16 receives textdata, which corresponds to the original voice data. This text data issent from the voice processing device 16 to the application processingdevice 14 then on to the application 12 for the physician's PDA.

[0042] The display 72 associated with the physician's PDA in thisexemplary embodiment shows that there are two dictations, each byPhysician S. Lance. A first dictation performed on May 3, 2002, at 12:50p.m. is represented by reference number 76 and a second dictation oneminute later as represented by reference number 78. Upon referencenumber 78, the text data is displayed in a portion 74 for entering data.In addition, the voice data or dictation by the physician is played, asillustrated by the playback control 80 showing progress of the voicedata as the actual voice data is played through a speaker. The physiciancan check the text data shown in entry portion 74 with the voice datathat is played through the speaker to check for accuracy of the textdata. The physician can edit the text data if there are any mistakes.Once the text data is approved by the physician, the physician canindicate that the text is approved by activating portion 82 on the touchscreen. Once the text is approved, an icon 84 appears next to thecorresponding dictation to show that the text has been approved by theentering physician.

[0043] The method and apparatus of the exemplary embodiment of thepresent invention provides a technique for capturing voice data andtagging this voice data or associating the voice data with a particularform. The voice data is then uploaded to a voice processing device 16for converting the voice data to text. The text data is then routed backand associated with the particular form, freeing the user from having tomanually enter data into a form. This technique is well suited to a widevariety of professions where it is cost effective to free theprofessional from the data entry task. In addition, the technique of thepresent invention is well suited for applications where the data entryinto forms is inconvenient such as various kinds of inspectors who areout at a remote site and don't have access to a keyboard for data entry.In this case, the voice data is captured, and once the user reaches asuitable location for uploading information to the applicationprocessing device 14, then this voice data is uploaded.

[0044] Finally, the technique of the present invention makes use offorms, which are currently in use. These forms are integrated into theapplication software of the application 12 and data can be entered. Bymaking use of current forms, the workflow for a particular applicationdoes not need to be modified, thereby minimizing transition costs frommanual entry of forms to this direct voice capture.

What is claimed is: 1) A method for processing voice data provided by anapplication program, the application program having a form associatedtherein for inserting text, the method comprising: receiving voice dataassociated with the form within the application program, the voice datahaving an identifier associated therewith for identifying the form;translating the voice data to corresponding text data; and providing thetext data and the identifier to the application program for associatingthe text data with the form. 2) The method of claim 1 wherein the voicedata received includes an identifier for identifying the speaker of thevoice data and wherein the text data is provided to the applicationprogram with the identifier. 3) The method of claim 1 wherein prior toreceiving voice data further including providing a header that isassociated with the voice data for identifying a source address and portfor an application processing device linked to the application programand identifying a destination address and port for a voice processingdevice linked to a voice processing device for translating voice data totext data. 4) The method of claim 3 wherein the voice data is sent tothe application processing device in real time. 5) The method of claim 1wherein prior to providing the text data further including providing aheader that is associated with the text data for identifying a hostaddress and port for an application processing device linked to theapplication program and identifying a source address and port for avoice processing device linked to a voice processing device fortranslating voice data to text data. 6) The method of claim 1 whereinthe voice data is provided by a PDA that is linked using a wireless linkto an application processing device, the application processing deviceprovides the voice data and identified in real time via an internetlink. 7) The method of claim 1 wherein prior to receiving voice dataassociated with the form performing the steps of: displaying a formhaving a request for information therein, the request for informationhaving a user selectable portion; selecting the user selectable portionand speaking the requested information wherein the requested informationis captured as voice data; and tagging the voice data with an identifierfor associating the voice data with the form. 8) A device for processingvoice data associated with an application program, the applicationprogram having a form therein for entering voice data, the applicationprogram providing an identifier associated with voice data foridentifying the form the voice data is associated with, the devicecomprising: an input portion for receiving voice data and identifierinformation from the application program; a translation portion fortranslating voice data to text data, the translation portion receivingthe voice data from the input portion and translating the voice data totext data; and an output potion for sending output text data and theidentifier information to the application program for association withthe form. 9) The device of claim 8 wherein the input portion and theoutput portion are a voice processing device that is linked to thetranslation portion and wherein the voice data and text data areexchanged between the application program and the voice processingdevice via an internet link. 10) The device of claim 8 further includinga portable processing device for executing the application programwherein the portable processing device sends voice data in real time viaa wireless link to an application processing device and wherein theapplication sends the voice data and identification information to theinput portion via an internet connection. 11) A method for processingvoice data comprising: providing a stream of voice data to anapplication processing device, the application processing deviceestablishing a connection to a voice processing device upon receipt ofvoice data whereby voice data is passed in real time to the voiceprocessing device; disconnecting the application processing device fromthe voice processing device upon termination of the stream of voice andpolling the voice processing device for completion of translation of thevoice data into text data; and requesting the text data for posting bythe voice processing device upon completion of translation of the voicedata into text data. 12) A system for processing voice data comprising:a processing device running an application program, the processingdevice having an input device for receiving a voice signal andconverting that voice signal into voice data; an application processingdevice configured for internet connection, the application processingdevice having a data link for receiving voice data in real time from theprocessing device; and a voice processing device configured for Internetconnection whereby voice data is passed in real time from theapplication processing device to the voice processing device. 13) Thesystem for processing voice data of claim 12 wherein the voiceprocessing device is configured for connection to transcriptionresources for conversion of voice data into text data, wherein the voiceprocessing device is configured to receive text data from thetranscription resources. 14) The system for processing voice data ofclaim 13 wherein the voice processing device is configured for sendingthe text data to the application processing device, the server passesthe text data to the processing device. 15) The system for processingvoice data of claim 12 wherein the processing device is a personal dataassistant. 16) A computer-readable storage medium storing a computerprogram product that, when run in a voice-processing platform forprocessing voice data, the stored program executing the steps of:receiving a header and envelope information from an internet connectedvoice data capture device running an application program, the headercontaining a destination address and a destination port for the voiceprocessing system, the header further including a source address and asource port for the voice data capture device, the envelope informationcontaining voice data, an application identification for the applicationprogram to identify context of the voice data and command informationfor specifying processing of the voice data by the voice processingplatform; processing the voice data as specified by the commandinformation to convert the voice data into text data, the text data iscombined with the application identification to form revised envelopeinformation and a revised header containing a destination address and adestination port for the voice data capture device, the header furtherincluding a source address and a source port for the voice processingsystem platform; and posting the revised envelope using the revisedheader. 17) The computer-readable storage medium of claim 16 whereinposting the revised envelope is performed after receipt of a requestfrom the Internet connected voice data capture device. 18) A method forcollecting data comprising: providing a form having a request forinformation therein, the request for information having a userselectable portion that when selected captures voice data from an inputdevice; and tagging the voice data with an identifier for associatingthe voice data to the form. 19) The method for collecting data of claim18 further comprising converting the voice data to text data andprovides an icon that is displayed when the form is viewed. 20) Themethod for collecting data of claim 19 further comprising providing anicon that when selected provides an icon that the text data has beenapproved. 21) A device for collecting data comprising: a display portionfor displaying a form having an insertion portion for insertinginformation therein, the insertion portion is selectable for capturingvoice data from an input device; and a control portion for tagging thevoice data with an identifier for associating the voice data to theform.